Mathematical Models of Overparameterized Neural Networks
نویسندگان
چکیده
Deep learning has received considerable empirical successes in recent years. However, while many ad hoc tricks have been discovered by practitioners, until recently, there a lack of theoretical understanding for invented the deep literature. Known practitioners that overparameterized neural networks are easy to learn, past few years important developments analysis networks. In particular, it was shown such systems behave like convex under various restricted settings, as two-layer NNs, and when is locally so-called tangent kernel space around specialized initializations. This paper discusses some these progresses leading significant better We will focus on networks, explain key mathematical models, with their algorithmic implications. then discuss challenges current research directions.
منابع مشابه
Mathematical Aspects of Neural Networks
In this tutorial paper about mathematical aspects of neural networks, we will focus on two directions: on the one hand, we will motivate standard mathematical questions and well studied theory of classical neural models used in machine learning. On the other hand, we collect some recent theoretical results (as of beginning of 2003) in the respective areas. Thereby, we follow the dichotomy offer...
متن کاملMathematical Models of Multiservice Networks
This paper describes several simple models that have helped our understanding of communication networks, and describes some of the new problems that arise in connection with the multiservice networks planned for the future.
متن کاملComparison of Artificial Neural Networks and Cox Regression Models in Prediction of Kidney Transplant Survival
Cox regression model serves as a statistical method for analyzing the survival data, which requires some options such as hazard proportionality. In recent decades, artificial neural network model has been increasingly applied to predict survival data. This research was conducted to compare Cox regression and artificial neural network models in prediction of kidney transplant survival. The prese...
متن کاملAsymptotic Theory of Overparameterized
A theory of overparameterized structural models is presented. In such a model some "redundant" parameters are involved; the parameter vector is not identified, and the information matrix is not nonsingular. The minimum discrepancy function (MDF) test statistic is shown to have an asymptotic chi-squared distribution almost everywhere for a wide class of discrepancy functions. Asymptotic distribu...
متن کاملMathematical Modeling of Artificial Neural Networks
Models and algorithms have been designed to mimic information processing and knowledge acquisition of the human brain generically called artificial or formal neural networks (ANNs), parallel distributed processing (PDP), neuromorphic or connectionist models. The term network is common today: computer networks exist, communications are referred to as networking, corporations and markets are stru...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the IEEE
سال: 2021
ISSN: ['1558-2256', '0018-9219']
DOI: https://doi.org/10.1109/jproc.2020.3048020